TOM, a new temporal neural net architecture for speech signal processing
نویسندگان
چکیده
FOR SPEECH SIGNAL PROCESSING Stéphane Durand and Frédéric Alexandre Crin-CNRS / Inria Lorraine BP 239, F-54506 Vand÷uvre-les-Nancy Cedex Nancy, France [email protected] , [email protected] ABSTRACT The neural net model TOM (Temporal Organization Map) that we present in the paper is a new connectionist approach whose time representation is di erent from the one in classical temporal connectionist models. The architecture is neurobiologically inspired and is dedicated to sensory problems involving a temporal dimension. The basic idea of the TOM model is the propagation of an activity throughout the network whose elements are organized according to a map architecture. This propagation leads to a triggering of a sequence detection. We have applied this new kind of architecture to a spoken digit recognition problem. The results draw near to the results of the best Hidden Markov Model (HMM) techniques. The interest of such an architecture is its genericity and the possibility to merge several data ows in order to improve the classical performances of neural nets.
منابع مشابه
The gamma model--A new neural model for temporal processing
In this paper we develop the gamma neural model, a new neural net architecture for processing of temporal patterns. Time varying patterns are normally segmented into a sequence of static patterns that are successively presented to a neural net. In the approach presented here segmentation is avoided. Only current signal values are presented to the neural net, that adapts its own internal memory ...
متن کاملA Time-derivative Neural Net Architecture - an Alternative to the Time-delay Neural Net Architecture
Though the time-delay neural net architecture has been recently used in a number of speech recognition applications, it has the problem that it can not use longer temporal contexts because this increases the number of connection weights in the network. This is a serious bottleneck because the use of larger temporal contexts can improve the recognition performance. In this paper, a time-derivari...
متن کاملشبکه عصبی پیچشی با پنجرههای قابل تطبیق برای بازشناسی گفتار
Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...
متن کاملPhoneme Classification Using Temporal Tracking of Speech Clusters in Spectro-temporal Domain
This article presents a new feature extraction technique based on the temporal tracking of clusters in spectro-temporal features space. In the proposed method, auditory cortical outputs were clustered. The attributes of speech clusters were extracted as secondary features. However, the shape and position of speech clusters change during the time. The clusters temporally tracked and temporal tra...
متن کاملNeural Network Classifiers for Speech Recognition
Neural nets offer an approach to computation thatmimics biological nervous systems. Algorithms based on neural nets have been proposed to address speech recognition tasks which humans perlorm with little apparent effort. In this paper, neural net classifiers are described and compared with conventional classification algorithms. Perceptron classifiers trained with a new algorithm, called back p...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996